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ABSTRACT 


The  objective  of  this  research  is  to  develop  a preprocessing  technique  for 
decomposing  pictorial  data  into  its  topological  units  and  subunits  so  as  to 
enhance  any  subsequent  feature/primitive  extraction.  To  enhance  the  later 
stages  of  perception  implies  that  each  topological  unit  and  its  subunits  must 
be  labeled  in  such  a way  as  to  reveal  their  relationship  to  the  other  topological 
units  of  the  pattern.  If  the  labeling  can  be  standardized  then  the  feature/ 
primitive  a'.dlysis,  which  is  very  application-oriented,  can  process  the  results 
of  the  preprocessor  decomposition  as  desired.  While  the  proposal  for  such  a 
decomposition  (called  segmentation  or  region  analysis)  is  not  new,  the  concept 
of  decomposition  to  reveal  a pattern's  intrinsic  structure  prior  to  and  inde- 
pendent of  feature/primitive  analysis  is  new. 
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1 . 1 INTRODUCTION 


An  accepted,  generalized  model  of  a pattern  recognition  system  is  shown 
in  Figure  1.  Classically,  the  transducer  as  the  input  device  senses  the  environment 
to  generate  a pattern;  the  preprocessor  readies  the  pattern  for  analysis  (e.g., 
noise  removal);  the  extractor  analyzes  the  pattern,  extracting  information 
which  will  enable  the  classifier  to  decide  to  what  class  of  known  patterns  the 
input  pattern  belongs. 

There  exists  both  neurophysical  evidence  in  certain  animals  and  psychological 
evidence  in  humans  that  indicate  that  feature/primitive  extraction  is  part  of  the 
recognition  process  (1) . However  there  also  exists  strong  psychological  evidence 
that  the  feature/primitive  processing  alone  is  insufficient  to  account  for  the 
phenomenon  of  pattern  recognition  (2).  Such  evidence  suggests  the  need  for  a 
pre-analytic  process  to  take  place  before  the  feature/primitive  analysis  begins. 

Such  a preprocessor  task  deals  with  determining  the  figure-ground  context  of  a 
pattern.  This  task  has  been  referred  to  by  different  psychologists  as  the  perception 
of  the  figural  unity  (3),  the  perception  of  the  generic  object  (4),  and  the 
perception  of  the  topological  object  (5).  This  proposal  is  in  contrast  to  the 
strictly  feature/primitive  viewpoint  which  essentially  Ignores  the  context  of  a 
pattern,  even  though  such  a context  has  been  recognized  as  an  important  part  of 
perception  (6).  However,  as  pointed  out  by  Deutsch  (7),  this  proposal  should  not 
be  confused  with  the  Gestalt  approach  of  template  matching.  The  Gestalt  viewpoint 
contends  that  perception  of  a figure  in  a pattern  is  not  a figure  in  isolation, 
but  a figure  set  upon  a ground  (context)  and  that  such  perception  is  the  only 
level  of  analysis  needed,  i.e.,  feature/primitive  analysis  is  not  necessary. 

Thus,  it  appears,  there  is  justification  and  need  for  a preprocessor  stage  that 
decomposes  a pattern  into  units  which  reveal  its  figure-context  relationship. 

As  indicated  by  Piaget  (5)  such  a decomposition  would  be  topological  in  nature  as 
opposed  to  projective-Euclidean  in  nature.  That  is,  the  decomposition  should  not 
depend  on  concepts  such  as  size,  angle,  distance,  parallelism,  etc.;  but  instead 
on  the  concepts  of  adjacency,  neighborhood,  connectedness,  dissection  of  an 
object,  etc.  (8). 

As  a practical  motivation  for  such  a decomposition,  consider  the  following 
proposal  by  Breeding  (9).  Breeding  has  concluded  that  while  impressive  results 
have  been  obtained  for  the  automatic  identification  of  aircraft  from  television 
patterns  using  moment  invariants  (10),  such  a technique  employs  global  features 
and  is  susceptible  to  the  absence  of  pictorial  data.  In  particular,  if  a subunit 
(e.g.,  wing)  of  a topological  unit  (e.g.,  aircraft)  is  obstructed  because  of 
cloud  cover,  camouflage,  shadows,  etc.,  then  misclassif ication  usually  results. 

Thus  it  would  be  advantageous  to  investigate  methods  of  decomposing  a topological 
unit  into  the  subunits  which  reveal  the  intrinsic  structure  of  the  topological 
unit.  These  subunits  would  then  be  considered  as  the  input  to  the  feature/ 
primitive  extraction  stage.  Such  . decomposition  technique  should  not  limit 
the  invariance  of  the  subsequent  recognition  technique  and  should  be  amenable  to 
a braod  spectrum  of  pictorial  patterns. 

With  these  ideas  in  mind,  the  objective  of  this  research  was  to  develop  a 
preprocessing  technique  for  decomposing  pictorial  data  into  its  topological  units 
and  subunits  so  as  to  enhance  any  subsequent  feature/primitive  extraction.  To 
enhance  the  later  stages  of  perception  implies  that  each  topological  unit  and  its 
subunits  must  be  labelled  in  such  a way  as  to  reveal  their  relationship  to  the 
other  topological  units  of  the  pattern.  If  the  labelling  can  be  standardized  then 
the  feature/primitive  analysis,  which  is  very  application-oriented,  can  process 
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the  results  of  the  preprocessor  decomposition  as  desired.  While  the  proposal  for 
such  a decomposition  step  (called  segmentation  or  region  analysis)  is  not  new, 
the  concept  of  decomposition  to  reveal  a pattern's  intrinsic  structure  prior  to  and 
independent  of  feature/primitive  analysis  is  new. 

To  accomplish  this  objective  the  following  questions  must  be  answered: 

1)  How  does  one  define  the  topological  units  of  a pattern? 

2)  How  does  one  label  these  topological  units? 

3)  How  does  one  define  the  subunits  of  a topological  unit? 

4)  How  does  one  label  these  subunits? 


2.1  CONNECT  RELATION 


li/hat  kind  of  a relation  would  best  define  the  topological  units  of  a pattern? 
The  topology  of  a pattern  can  be  represented  in  many  different  formalisms,  e.g., 
adjacent  cells  of  a discrete  plane,  neighboring  automata  of  an  array  processor, 
connected  vertices  of  a network  grid,  etc.  To  express  all  these,  a generalization 
of  the  notion  of  connectivity  is  needed.  Let  X and  Y be  denumerable  sets,  then 
define  connect<Y//X>  as  the  subset  of  all  elements  of  X which  are  adjacent  to 
the  elements  connect^<Y//X>  = YflX.  If  connectO<Y//X>  = $,  then  connect<Y//X>  = <f. 
The  second  argument  preceded  by  a //  symbol  denotes  the  restriction  of  Y to  X. 
Elements  adjacent  to  adjacent  elements  can  be  represented  by  a recursion  of 
connect  (see  Figure  2). 

2 

connect  <Y//X>  = connect< (connect<Y//X>)//X> 

3 

connect  <Y//X>  = connect < (connect < (connect <Y//X>) //X> )//X> 

The  recursive  closure  of  connect  is  given  by 


connect<Y//X>  = connect  <Y//X> 


where  n is  such  that 

connecjtn<Y//X>  = connectn+^<Y//X> . 

A set  X is  called  a connected  set  if  Vx^eX,  connect<x^ / /X>  = X.  Note  that  since 

the  concept  of  adjacency  is  undirected,  connect  is  an  equivalence  relation  and  thus 
can  partition  a set. 
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3.1  PICTURES 


3.1.1  Picture  as  Input 

Physically,  an  input  pattern  will  be  a two-dimensional,  binary-valued 
array.  Such  an  input  is  practical  to  obtain  when  a TV  camera  is  used  as  the 
transducer  (11).  If  the  TV  system  is  equipped  with  variable  threshold  and  color 
variables  (12),  the  real-world  environment  can  be  approximated  by  a set  of 
parameter  dependent  binary-valued  patterns. 


Mathematically,  an  input  pattern  will  be  considered  as  a regular  tessellation 
of  the  plane  into  binary-valued  cells.  For  a plane,  there  are  only  three  possible 
regular  tessellations:  triangular,  square,  and  hexagonal.  On  each  such  tes- 
sellation various  indices  can  be  defined  such  as  forward  raster,  backward  raster, 
diagonal,  etc.  These  indices  allow  one  to  talk  about  the  i1*1  cell  of  a given 
tessellation.  A fixed-size,  connected  set  P of  these  cells  will  be  called  picture 
cells.  This  set  P will  be  bounded  by  a reachable  set  B called  background  cells. 
The  values  of  the  background  cells  do  not  vary  and  are  a priori.  The  values  of 
the  picture  cells  do  vary  and  are  given  by  the  discrete  binary-valued  function 
pict<i>  where  i represents  the  index  of  the  iC  cell  for  a given  tessellation. 

3.1.2  Topological  Units  of  a Picture 

Using  pict  as  the  characteristic  function  of  a set,  the  following 
topological  subsets  of  a picture  can  be  defined.  The  images  set  I is 

{j  : pict<1>  = b;  j e P) 
where  b may  equal  either  0 or  1 but  not  both. 

The  ground  set_I  is  defined  as  the  set  of  all  picture  cells  which  are  not 
image  cells,  I.e.,  I = P-I.  The  surrounds  ret  D is 

{j  : BCconnect<j//lUB> ; j e I }. 


Thus  D is  the  subset  of  all  ground  cells  that  are  connected  to  the  background  B. 

The  holes  set  H is  the  subset  of  all  ground  cells  not  connected  to  the  background 

i.e.,  H = I-D.  From  these  definitions  it  follows  that  these  three  topological 

subsets  are  pariwise  disjoint  and  thus  form  a partition  of  the  picture  set  P, 

and  that  for  picture  P = I UH  UD  . 

a a a a 


The  basic  topological  unit  of  a picture  will  be  a region.  A region,  R, 

is  any  connected  subset  of  a picture.  Since  a region  is  a connected  set,  the 

following  definitions  can  be  made.  A cell  jcR  is  a border  cell  if  connect< j //PUB>iR, 
i.e.,  not  all  the  cells  adjacent  to  j are  in  R (see  Figure  3).  The  border  cells 
of  a region  are  partitioned  into  contours  = connect< j //X>  where  jeX  = {border 

cells}.  Thus  a contour  of  a region  is  a connected  set  of  border  cells,  ^CcR.  A 

simply-connected  region  is  a region  that  has  only  one  contour  (see  Figure  4).  A 

multiply-connected  region  is  a region  that  has  two  or  more  contours  (see  Figure  5). 


The  sets  I,  D,  and  H can  be  partitioned  into  topological  regions  called 
respectively,  images,  surrounds,  and  holes.  Each  image  ^ = conflect<i//I> ; each 

surround  jD  - connect<j//D> ; and  each  hole  » connect<k//H>  (see  Figure  6). 


I 
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From  these  topological  regions  a pseudo  region,  called  a collage,  can  be 
defined.  The  collage  set  K is  the  set  of  all  collages.  A collage  is  fabricated 
by  combining  topological  regions  and  is  useful  for  decomposing  regions  nested 
within  regions  (see  Figure  7).  A collage  may  or  may  not  be  simply-connected. 

3.1.3  Print  of  a Picture 


If  one  agrees  to  talk  about  picture  subsets  only,  then  P can  be  considered 
as  the  universal  set.  As  a result,  one  can  agree  that  i^X  implies  ieP-X  since 
X£P.  Define  the  print  of  a picture  subset  X,  denoted  ^P,  as 


pict,c<i> 


f 1 ieX 
| 0 i^X. 


This  leading  subscript  notation  is  used  to  differentiate  prints  from  the  usual 
following  subscript  notation  for  subsets.  The  print  of  a single  cell  j,  denoted 
. P,  is  just 

£i£t.<i>  = 


1 = J 

1 =1=  j- 


The  relationship  between  a print  and  a picture  is  given  by  the  develop  relation. 


jP  ■=  develop<P> . 

That  is,  develop  operates  on  the  binary  values  of  a picture  such  that  the  image 
cells  are  labelled  1 and  the  ground  cells  are  labelled  0.  To  do  this  labelling 
it  is  necessary  to  have  a priori  knowledge  about  how  the  TV  system  assigns 
binary  values.  The  importance  of  the  develop  relation  lies  in  the  fact  that  one 
can  work  with  prints  knowing  that  they  can  always  be  found  for  any  picture. 


Because  pictures  are  fixed-size  sets  it  makes  no  sense  to  talk  about  P 

a 

being  a subset  of  P^.  However,  two  pictures  are  said  to  be  equal  if  T P = ^ P, 

a b 

and  not  equal  if  P ^ P. 

a b 


3.1.4  Print  Operations 


For  a given  tessellation  and  a given  index  one  can  define  a print  operation 
between  the  cells  of  one  print  and  the  cells  of  another  print  to  produce  a new 
print.  For  example,  print  operations  analogous  to  Boolean  operations  can  be 
defined  since  each  cell  takes  on  only  two  values.  Define 


5S4<XP’YP>  = zp 


where  Z = {j  ; pict^< j > & picty<j > = 1}  and  & is  the  Boolean  and. 
Define 


~<XP,YP>  = ZP 


where  Y = {j  : pict„<j>  | pict„<j>  = 1}  and 

A Y 


is  the  Boolean  or. 
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Define 


comp<xP>  = 

where  Y = {j  : pict  <j>  = 0}.  Also  a generalized  picture  and  and  or  can  be 
defined. 

n m m,n 

it  P = and<v  P,and<  P,...,and<  P,  P>...»,  also  t t P,  u P 

n n-1  21  j ] J i 

n m m,n 

' x P = or<x  P,or<x  P,...,o£<x  P,  P>...»,  also  v ^P , V x P 

i i n n-1  2 1 j j i,j  j i 

Likewise,  one  can  define  a picture  operation  on  the  cells  of  an  existing 
picture  to  produce  a new  picture.  Define 


index<  P>  = ,P 
X j 

where  j is  the  first  index  jeX.  Define 


reach<YP//xP>  = ^ 

where  Z = connect<Y//X> . Similar  to  before,  reach  <„P//„P>  = „P  where 

0 Y X Z 

Z = X lY,  in  other  words  reach  '-'yP//xP>  - and<  P,  ^P> . Define 

remain<„P//„P>  = and<„P,comp<reach<„P//„P>>> . 

I A A ' Y X 

The  remain  operation  can  be  considered  as  a complement  to  the  reach  operation. 
That  is,  while  the  reach  operation  produces  a print  of  all  cells  extracted  from 
an  existing  print,  the  remain  operation  produces  a print  of  all  cells  remaining 
in  the  existing  print  after  a reach  extraction  (see  Figure  8). 


To  show  how  these  picture  operations  can  be  used,  consider  the  recurring 
problem  of  extracting  the  next  region  with  index  greater  than  k from  the  subset 
X.  Assume  the  next  such  region  has  index  j , then  in  terms  of  prints 

k 

reach<index<remain<  v ,P//„P>>//„P> . 

~ .IX  X 


k 

remain<  T .P//VP  will  produce  a print  P that  has  all  regions  of  index  k or 

1 1 A Z 

less  removed  from  it.  The  index<^,P>  must  then  be  ^P  by  assumption,  resulting 

in  reach<.P//„P>,  the  desired  result. 

J a 
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4.1  SUBREGION  RELATION 


What  kind  of  a relationship  would  best  reveal  the  intrinsic  structure  of  a 
pattern?  Since  the  topological  units  and  subunits  are  being  generated  by  a 
decomposition,  a hierarchical  type  relation  is  implied.  That  is  to  say, 
a decomposition  implies  that  the  topological  units  and  subunits  are  related  such 
that  they  recursively  fit  together  to  build  up  the  whole  pattern.  Thus  it  is 
assumed  that  the  choice  of  a hierarchical  relation  (partial  order  relation  (14)) 
for  labelling  the  topological  units  will  best  reveal  the  intrinsic  structure  of 
the  pattern  and  hence  enhance  any  subsequent  perception  processing. 

Since  a region  has  been  chosen  as  the  basic  topological  unit  (as  opposed 
to  a subset),  a subregion  relation  was  chosen  as  the  hierarchical  relation. 

This  partial  order  relation  was  defined  as  ,R<.R  if  .RC.R  and  ,R<.R  if  . RC . R . 

i — J l—J  i J l J 

While  conceptually  simple,  this  relation  has  proven  effective  for  two  reasons. 
First,  because  of  the  way  topological  regions  of  a picture  are  spatially 

represented,  if  .KT.R  ^ $,  then  either  ,R<.K  or  .R=.K.  Second,  because  the 
1 J 1 i J l 

subregion  relation  can  be  applied  to  region  decomposition  also,  a unified  theory 
can  be  presented  for  topological  decomposition. 


5.1  PICTURE  DECOMPOSITION 


A picture  in  its  simplest  form  has  only  a multiply-connected  surround  and 
a simply-connected  image.  It  is  the  presence  of  a multiply-connected  image  that 
introduces  holes  and  nested  images.  However  such  multiply-connected  regions 
can  be  changed  to  a pseudo  simply-connected  region  by  creating  a collage.  Such 
a collage  would  consist  of  the  multiply-connected  region  plus  all  other  topolog- 
ical regions  "inside"  the  multiply-connected  region.  Thus  any  picture  can  be 
decomposed  into  its  topological  regions  (I,  .D,  ^K)  by  representing  it  as  a 
sequence  of  collages.  Each  collage  will  fee  decomposed  into  its  "outer"  topological 
region  while  the  "inner"  topological  regions  will  form  a new  collage.  The 
decomposition  stops  when  the  new  collage  is  a single  topological  region,  and 
hence  can  not  be  segmented  anymore. 


Define  the  surround<„P>  as 
A 


u 


Bcreach  i//(P-X)UB>:  ieP-X}. 


Now  given  the  print  ^P  of  any  picture  the  following  prints  can  be  derived 

(1)  pP  = surround<^P>  ^ 

(2)  pP  = reach<index< remain*^  ^P // ^P>> // pP> 

j 1 

(3)  ,.P  = comp<surround<TP>> . 

tv.  J. 

Each  ^D  represents  a surround  and  K represents  the  collages  set.  Some  of  the 

collages  of  P represent  simply-connected  images,  and  others  represent  mutliply- 
K. 

connected  images  with  holes  and  other  images  "inside"  them.  The  mutliply- 
connected  images  are  extracted  as  part  of  a collage. 

k 

(4)  ,,P  = reach<index<remain< T _.P / / „P>> / / „P> 

. K —————  1 i K K 

j 

and  then  segmented  by 

(5)  TP  = and<surround<and<comp<TP> , „P>>,  ,,P>. 

. I ' 1 , K , K 

j 3 J 

A new  collage  can  now  be  derived  from  each  exisitng  collage  minus  the 
multiply-connected  image  by 


(6)  ,,P  = and<comp<surround<and<comp<v.P> , „P>>>,  „P>. 

~ X j N , N 


Some  of  the  collages  of  (6)  represent  simply-connected  holes  and  others  represent 
multiply-connected  holes  with  images  and  other  holes  "inside"  them.  The  multiply- 
connected  holes  are  extracted  as  part  of  a collage  as  in  (4)  and  then  segmented  by 

(7)  „P  = and<surround<and<„P,  „P>>,  ,,P>. 

JH  1 jK  jK 


A new  collage  can  now  be  derived  from  each  existing  collage  minus  the 
multiply-connected  hole  by 
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(8)  „P  = and<comp^ surround- and  TP,  „'P>>> , „P>. 

K — ——  |K  K 

ii  n 

Some  of  the  collages  of  (8)  represent  simply-connected  images  and  others,  multiply- 
connected  images  with  holes  and  other  images  "inside"  them.  These  can  be 
decomposed  using  (4)  and  (5)  and  so  forth. 

5.1.1  Grammar  Model 


The  picture  decomposition  can  be  modelled  by  a context-free  phrase 
structured  grammar. 

VT  = {d,  ?,  .,  ),  (}, 

and  the  production  rules  are: 


P > (DK)? 

P -*•  ? 


D 


• DD 


D ► .D 
3 

K -*  . KK 
3 

K -*■  ,K 
3 

•+  ( IK) 

.K  -*■  .1 

3 3 

.K  -*■  ( . HK) 
3 3 

.K  ->•  .H 
3 3 

.D  -*■  R: 

3 

JWR- 

H - R; 

R -*■  d 


where  P is  the  start  symbol  and  d represents  some  kind  of  descriptor  for  a region. 
Note  how  the  punctuation  marks  label  the  regions  as  being  either  a surround,  an 
image,  or  a hole.  The  string  derivation  for  Figure  2 is: 


P (DK)? 

*(  D D(  I H(  H I))  I)? 
a c b d f g e 


(9)  =»(d:d:  (d.d;  (d;d.))d.)? 


Because  the  grammar  is  context-free  there  exists  a derivation  tree  for 
expression  (9).  Such  a tree  depicts  the  hierarchical  relationship  between  the 
topological  regions  of  the  picture.  That  is  to  say,  from  such  a tree,  or  more 
precisely,  from  a string  of  the  language,  a Hasse  diagram  for  the  subregion 
relation  can  be  derived.  For  example,  if  the  parentheses  are  interpreted  as 
levels  in  the  Hasse  diagram  and  the  descriptors/punctuation  marks  as  topological 
region  labels,  then  the  Hasse  diagram  of  Figure  9 can  be  derived  for  string  (9). 
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Because  of  this  grammar  model,  the  decomposition  can  be  defined  as  the  output 
of  a parser.  The  parser  proceeds  top-down  when  it  is  given  a picture  and 
produces  as  output  a string  of  topological  region  descriptors.  The  parser 
proceeds  bottom-up  when  it  is  given  a string  of  topological  region  descriptors 
and  produces  as  output  a picture.  In  each  case  the  output  is  uniquely  determined 
to  within  an  index  only,  e.g.,  compare  Figure  10  with  Figure  11. 


6.1  REGION  DECOMPOSITION 


The  principles  of  picture  decomposition  can  be  applied  to  region  decomposition 
if  a region  could  be  separated  or  disconnected.  This  separation  would  create 
new  regions  which  could  then  be  hierarchically  related  through  the  subregion 
relation.  Intuitively,  one  could  envision  a simply-connected  region  being 
decomposed  and  related  as  illustrated  in  Figure  12.  Basically,  cells  or  areas  of 
decomposition  are  determined  and  the  region  is  separated  at  these,  thus  forming 
new  regions.  Such  an  operation  will  be  referred  to  as  forming  the  separable 
subregions  of  a region.  These  separable  subregions,  denoted  .R,  will  be  considered 
as  the  topological  subunits  of  a region. 


6.1.1  Skeleton  Graph 


Let  S represent  the  medial  axis  or  internal  skeleton  of  a region  R as 

defined  by  Blum  (14).  Such  a descriptor  is  located  at  the  median,  between  the 

contours  of  a region  and  forms  a connected  graph-like  region  (see  Figure  13). 

The  skeleton  of  a region  can  be  generated  by  successively  removing  certain 

contour  cells  (15).  Let  skel<  P//,_  P>  denote  the  picture  operation  that  executes 

R 

one  removal  of  contour  cells  from  region  R restricted  by  mask  M. . 

Then  the  medial  axis  for  region  R restricted  by  mask  is  given  by 
the  recursive  closure  of  the  skel  operation. 


sp  - *«!<Rr//M  p>. 


1 

The  skeleton  is  made  up  of  branches  and  nodes  N,  the  places  where  three  or  more 

branches  meet.  The  skeleton  will  be  represented  by  a graph  G = (V,E).  The 

vertices  V = {v.,v,,...,v  } are  the  end  cells  of  the  branches  and  the  nodes  N.  The 
1 l n 

edges  E = {e. ,e. , . . . ,e  } are  the  branches  and  each  is  associated  with  two  vertices, 
lz  m 

The  skeleton  graph  will  be  assumed  to  have  no  self-loops  or  parallel  edges,  so 
that  each  branch  of  the  skeleton  can  be  uniquely  associated  with  an  edge.  The 
skeleton  graph  for  a simply-connected  region  has  no  circuits  and  hence  is  a tree. 


6.1.2  Separable  Nodes 

Define  two  special  prints:  the  universal  print  ^P,  pict^<i>  = 1 VicP; 
and  the  empty  print  ^P,  pict  ^<i>  = 0 VieP.  Define  the  picture  operation  of 
growing  a skeleton  cell  j as 

grow <jP/ /RP>  = reachn<j  P//lJP> 

where  n is  the  minimum  y such  that 

and<RP,  reachy<jP//yP>>  ^ or  <RP,  reachy  <^P//yP>>. 

The  growth  of  a set  of  skeleton  cells  X,  is  defined  as 

growth<xP//RP>  = J grow<and<1P,xP>//RP> . 

Two  nodes  n^,n^  N£S  of  a region  R are  nonseparable  if 

and<grow<n  P//RP> , grow<n  P//RP>>  + ^P, 

i.e.,  their  growths  merge.  Generalizing  this,  the  nodes  N£S  of  a region  R 
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can  be  partitioned  by  the  separable  relation 


NL  = sep<n^ ,N//R>  = fn^  : n^  and  n^  are  nonseparable } . 

Thus  all  the  nodes  in  a block  N.  of  the  partition  of  N are  nonseparable,  but  as 
a group  they  create  a set  of  nodes  separable  from  all  other  nodes  of  N.  Each 
is  called  a separable  node. 

6.1.3  Separable  Subregions 

Let  N^£N£S  for  some  region  R,  then  the  separable  subregions  adjacent  to 
are  given  Dy  the  print 

„P  = and<„P,  comp<growth<„  P//„P>>. 

Z R K 

A print  of  a separable  subregion  R can  be  obtained  in  the  usual  fashion  from 

zp  by  1 k 


j 


P = reach<index<remain< Y P//  P>>//  P>. 
K i i z z 


Let  the  part  of  skeleton  associated  with  separable  subregion  .R  be  denoted  .S, 
then  a print  of  ^S  can  be  obtained  by  33 

P = and<  P,  P>. 

3 j 


6.1.4  Primitive  Separable  Subregion 

Each  subregion  ^R  can  be  further  decomposed  into  separable  subregions  by 

selecting  some  separable  node  N^£jS  and  growing  it.  If  an  ^S  has  no  nodes, 

further  decomposition  of  the  subregion  is  not  possible.  This  lack  of  nodes 
provides  a stopping  criterion  for  the  decomposition.  Define  a primitive 
separable  subset  as  a subregion  ^R  whose  associated  skeleton  has  no  nodes. 

Obviously  if  skeleton  S of  region  R has  no  nodes,  then  R itself  is  a primitive 
separable  subregion.  Figure  14  shows  a region  and  its  skelton  that  has  four 
nodes,  and  three  separable  nodes.  The  center  separable  nodes  has  four  separable 
subregions  adjacent  to  it,  the  other  two  separable  nodes  have  three  separable 
subregions  adjacent  to  them.  There  are  a total  of  eight  primitive  separable 
subregions. 


6.1.5  Shared  Separable  Subregion 

Figure  14  shows  a decomposition  where  the  separable  nodes  were  selected 
one  at  a time  or  one-selected;  however,  the  separable  nodes  can  also  be  multiply- 
selected.  For  mutliply-selected  separable  nodes,  some  of  the  separable  subregions 
will  be  adjacent  to  more  than  one  separable  node.  Such  subregions  are  called 
shared  separable  subregions  (see  Figure  15). 


6.1.6  Redundancy 


The  effect  that  these  shared  subregions  haveon  the  results  of  the 
decomposition  is  to  appear  as  a redundant  subregion  in  the  hierarchical  relation- 
ship. If  a region  decomposition  contains  no  shared  separable  subregions  it  is 
called  nonredundant ; otherwise,  it  is  called  redundant.  A redundant  region 
decomposition  in  which  all  shared  separable  subregions  are  primitive  is  called 
primitive  redundant. 


6.1.7  Grammar  Model 


The  region  decomposition  can  be  modelled  by  a context-free  phrase  structured 
grammar. 


VN  = 1R,N,0,E,X}, 

VT  = (e,n, [ , ] }, 

and  the  production  rules  are: 

R - [0] 

R -*•  [N] 

R ->  [Nj 

R -v  [E] 

X - [0] 

X - [0]X 

0 -*  [N]0 

0 ->  [N] 

X ->  [N]X 

X ->  [N] 

N -*■  NjX 

N . -*•  n 
x 

X ■>  [E]X 
X > [E] 

E -*■  e 

R is  the  start  symbol;  while  e represents  a descriptor  for  an  edge,  i.e.,  a 
primitive  separable  subregion,  and  n represents  a descriptor  for  a separable 
node.  The  symbol  0 stands  for  a pseudo  node.  A pseudo  node  is  a hypothetical 
node  that  enables  mutliply-selected  separable  nodes  to  have  a common  ancestor. 

The  string  derivation  for  Figure  10  is: 

g to] 

* [[N.[E][E][Ni[E][E][E]]lNi[E][E][NitE][E][El]]] 

(10)  £ l [n1[e1][e2]  [n3[e3]  [e4]  [e6]  ] Jn^e,.]  [e2]  [n^J  [e^]  [e6]  ] ] ] . 

Because  the  grammar  is  context-free  there  exists  a derivation  tree  for 
string  (10)  which  depicts  the  hierarchical  relationship  between  the  separable 
subregions  of  the  region.  For  example,  if  the  brackets  are  interpreted  as  levels 
in  the  Hasse  diagram,  and  the  edge  descriptors  as  separable  subregion  labels, 
then  the  Hasse  diagram  of  Figure  16  can  be  derived  for  string  (10). 

Also  because  of  this  grammar  model,  the  decomposition  can  be  defined  as  the 
output  of  a parser,  just  as  was  done  for  the  picture  decomposition.  Once  again 
the  output  of  the  parser  is  uniquely  determined  to  within  an  index  only. 
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7.1  VERTEX  ATTRIBUTES 


The  hierarchical  relationship  for  the  separable  subregions  of  a region  is 
completely  dependent  on  the  order  in  which  the  separable  nodes  of  the  skeleton 
graph  are  selected,  and  whether  they  are  one-  or  mutl iply-selected . To  guide 
the  selection  of  separable  nodes,  the  different  characteristics  of  a skeleton 
graph  must  be  sorted  out  so  that  the  cause  and  effect  relationship  between 
vertices  and  edges  can  be  identified. 

The  syntax  of  a graph  characterizes  the  interconnection  of  the  vertices 
and  edges.  The  semantics  characterizes  the  size  and  shape  of  the  edges  and 
vertices.  A graph  with  the  same  syntax  can  have  infinitely  many  semantic 
characterizations.  The  semantics  of  a particular  graph  can  be  defined  by 
associating  attributes  with  each  edge  and  vertex.  The  attributes  can  be 
thought  of  as  functions  whose  values  are  assigned  depending  on  the  meaning  that 
the  vertex  or  edge  has  for  a particular  graph. 

For  selecting  nodes,  the  vertex  attributes  will  be  used.  Two  types  of 
vertex  attributes  have  been  recognized  as  aids  in  giving  a "meaningful"  ordering 
for  the  nodes  of  a skeleton:  cluster  and  selection.  A cluster  attribute  is  one 
that  characterizes  a local  property  of  a vertex.  The  concept  of  vertex  clustering 
is  based  on  pattern  recognition  work  which  is  interested  in  grouping  points 
using  a local  optimization  criterion  (16).  A selection  attribute  is  one  that 
characterizes  a global  property  of  a vertex.  The  concept  of  vertex  selection 
is  based  on  operational  research  work  which  is  interested  in  choosing  points 
using  a global  optimization  criterion  (17). 

7.1.1  Cluster  Attributes 


For  a vertex  v^  define  the  following  useful  cluster  attributes: 


1) 

2) 

3) 


degree<v^>  = n where  n is  the  number  of  edges  incident  on  v^; 

si 

size<v^>  = s^  where  grow<^  P//RP>  = reach  <v  P//jjP>; 


edge<v1>  = •"’ik*  " * ,min}  where  vj  ’ 

adjacent  to  v.  and  m . , = metric<vJ  v >. 

i ij  i * j 


v are  all  the  vertices 
n 


Given  a cluster  attribute,  a cluster  relation  can  be  defined.  A cluster 
relation  for  the  a^  attribute  on  a connected  graph  (V,E)  about  vertex  v^, 

denoted  a^-cluster<v^, (V,E)>,  is  defined  as  the  set  of  all  connected  vertices  from 

vertex  that  have  an  equivalent  a^  attribute  value. 


Because  the  cluster  relation  is  an  equivalence  relation,  each  a^  attribute 
forms  a partition  of  the  vertices  V.  Denote  such  a partition  as  $a  . A set 

of  such  partitions  {<£  , $ ,...,$  } forms  a partial  ordered  set  under  the  refinement 

3i  SL 

12  n 

relation  (18).  Thus  the  logical  combination  (conjunction  and  disjunction)  of 

cluster  attributes  can  be  defined  from  the  greatest  lower  bound  and  the  least 

upper  bound  operations  on  this  partial  ordered  set.  For  example,  a partition 

of  all  vertices  with  equal  degree  and  equal  size  can  be  formed  from  the  least 

upper  bound  of  partitions  and  . 

degree  size 
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7.1.2  Selection  Attributes 


For  a vertex  v^,  define  the  following  useful  selection  attributes: 

1)  eccen<Vj>  = max<s^  * tru^>; 

2)  weight<v^>  = Is^  * m„ ; 

where  1 j < cd<V>  and  c<i  is  the  cardinality  of  the  set. 

Given  a selection  attribute,  a select  relation  can  be  defined.  A select 
relation  for  the  a^  attribute  on  a connected  graph  (V,E),  denoted  a^-select< (V,E)> , 

is  defined  as  the  set  of  all  vertices  that  have  the  minimal  a^  attribute  value. 

For  the  special  case  when  all  s.  are  equal,  the  minimum  eccentricity  corres- 
ponds to  the  center  of  the  graph  and  the  minimum  weight  corresponds  to  the  median 
of  the  graph  (19).  The  definitions  as  given  here  are  dubbed  the  weighted  center 
and  the  weighted  median,  respectively. 

7.1.3  Simply-Connected  Regions 

Insight  into  the  relationship  between  separable  nodes,  separable  subregions, 

and  vertex  attributes  can  be  gained  through  the  following  theorems.  Denote 

N as  the  set  of  nodes  selected  by  a select  relation;  N as  the  set  of  nodes 
s cs 

generated  by  forming  clusters  about  each  node  of  N ; and  an  n-selected  separable 

node  as  either  a one-  or  a multiply-selected  separable  node. 

Lemma  1 ; 

If  the  set  N^s  is  connected,  then  the  set  of  all  n-selected  separable 
nodes  generated  from  Ncg  is  connected. 

Lemma  2 : 

If  the  set  N is  connected,  then  the  set  N is  connected, 
s cs 

Theorem  1: 

If  each  N of  a redundant  decomposition  of  a simply-connected  region  is 
connected,  then  the  decomposition  is  primitive  redundant. 

Lemma  3: 

If  two  weighted  centers  (weighted  medians)  of  a tree  T = (V,E)  are  non- 
adjacent,  then  all  the  vertices  in  the  path  connecting  the  two  weighted 
centers  (weighted  medians)  cannot  be  weighted  centers  (weighted  medians). 

Lemma  4 : 

If  two  weighted  centers  (weighted  medians)  of  a tree  T = (V,E)  are  nonadjacent, 
then  at  least  one  vertex  in  the  path  connecting  the  two  weighted  centers 
(weighted  medians)  must  be  a weighted  center  (weighted  median). 

Theorem  2: 

Every  tree  T = (V,E)  has  only  one  or  two  weighted  centers  (weighted  medians) 
and  the  two,  if  they  exist,  must  be  adjacent,  hence  connected. 
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Theorem  3: 

If  the  weighted  center  or  weighted  median  is  used  as  a selection  attribute, 
any  redundant  decomposition  of  a simply-connected  region  will  be  primitive 
redundant . 


8.1  1 Ml’  LKMKNTAT  I ON 


The  picture  operations  were  implemented  on  a binary  array  processor  as 
proposed  by  Breeding  (9).  For  this  work,  the  proposed  hardware  array  processor 
was  simulated  by  the  software  system  dubbed  GAPS  (General  Array  Processor 
Simulator).  Besides  the  picture  operations  already  mentioned,  operations  to 
smooth  contours,  fill  gaps,  and  remove  salt-and-pepper  noise  were  also  defined. 

In  addition,  tasks  which  use  the  array  processor  and  conventional  arithmetic 
computation  were  defined  to  prune  the  branches  of  the  noisy  skeleton,  and  to 
screen  out  regions  which  were  considered  too  small  to  be  topologically  relevant. 

For  the  special  case  of  a simply-connected  region  when  no  cluster  relations 
are  defined  and  the  selection  attribute  is  the  center  (i.e. , all  s^  are  assumed 
equal),  the  decomposition  can  be  completely  implemented  by  picture  operations. 
Such  picture  operations  were  used  on  the  aircraft  decomposition  suggested  by 
Breeding.  A print  of  an  aircraft  was  first  parsed  in  a top-down  fashion.  The 
resulting  string  description  was  then  analyzed  to  extract  and  label  the  separable 
subregions  corresponding  to  the  wings,  the  nose  section,  and  the  tail  section. 
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9.1  SUMMARY  AND  CONCIAIS TONS 


In  the  past,  the  decomposition  of  patterns  has  been  viewed  as  two  apparently 
distinct  steps:  1)  decomposition  into  topological  units;  2)  decomposition  of 
each  unit  into  "primitives".  Step  1 was  realized  either  by  the  classical  method 
of  contour  following  (22),  or  by  the  more  recent  development  of  heuristic  analysis 
of  the  contours  (23,24).  Step  2 was  realized  either  by  restricting  the  units  to 
line  drawings  (25,26),  by  approximating  the  units'  contours  by  straight  lines 
(27,28),  or  by  approximating  the  units  by  convex  polygons  (29-32). 

In  contrast,  the  decomposition  described  here  considers  these  two  steps  to 
be  related.  The  pattern  was  first  considered  as  a fixed-size  set  of  picture 
cells,  then  its  topological  units  (regions)  were  defined  and  hierarchically 
related.  These  concepts  were  then  extended  by  defining  topological  subunits 
(separable  subregions)  of  a region.  These  subregions  were  then  related  using 
the  same  hierarchical  relation  as  before. 

In  comparison,  the  picture  decomposition  described  here  is  conceptually 
similar  to  Brice  and  Fennema's  region  analysis  (24).  Brice  and  Fennema  deal 
with  general  gray  level  pictures,  and  approximate  the  region  contours  by  straight 
lines  so  heuristic  analysis  can  be  used  to  label  the  regions.  In  contrast 
this  research  applies  to  binary-valued  pictures  only,  does  not  approximate 
the  contours,  and  uses  a nonheuristic  method  of  labelling  the  regions. 

In  comparison,  the  region  decomposition  described  here  is  conceptually 
similar  to  Pavlidis  formation  of  convex  subsets  (29,30).  Pavlidis  deals  with 
regions  approximated  by  polygons,  and  generates  a set  of  overlapping  polygons 
called  primary  subsets  whose  union  gives  the  polygon  region  (see  Figure  17). 

The  generation  of  the  region's  half-plane  extensions  is  necessary  to  locate  the 
primary  subsets.  The  areas  where  the  primary  subsets  overlap  are  called  the 
nuclei.  A graph  where  the  nuclei  and  primary  subsets  are  vertices,  and  the  edges 
denote  an  intersection  relationship,  can  be  derived  to  represent  the  region.  This 
graph  is  not  dependent  on  the  order  in  which  the  nuclei  of  the  region  are  deter- 
mined. The  graph  is  a web  and  the  decomposition  of  the  region  it  represents  can 
be  modelled  by  a context-sensitive  web  grammar.  In  contrast  this  research  does 
not  approximate  the  contours,  and  generates  a set  of  primitive  separable  sub- 
region  whose  union  does  not  give  the  region.  The  generation  of  the  region's 
skeleton  is  necessary  to  locate  the  primitive  separable  subregions.  A graph 
where  the  separable  nodes  and  primitive  separable  subregions  are  vertices,  and 
the  edges  denote  a hierarchical  relationship,  can  be  derived  to  represent  the 
region.  This  graph  is  dependent  on  the  order  in  which  the  separable  nodes  are 
selected.  The  graph  is  a tree  and  the  decomposition  which  it  represents  can  be 
modelled  by  a context-free  phrase  structured  grammar.  Interestingly  enough, 
a separable  node  always  corresponds  to  a nuclei;  however,  not  every  nuclei 
corresponds  to  a separable  node,  e.g.,  compare  Figures  17  and  18  to  Figure  19. 

Ignoring  for  the  moment  the  appropriateness  of  using  a hierarchical 
relation  for  labelling  regions  and  subregions,  two  conclusions  about  this  research 
can  be  made.  First,  the  fact  that  the  picture  decomposition  is  topologically- 
oriented  and  can  be  implemented  by  local  parallel  operations,  indicates  it 
may  be  mimicking  certain  human-perception  tasks  suggested  by  psychologists  (2). 
Second,  the  fact  that  the  region  decomposition  can  invariantly  extract  the 
protrusions  (separable  subregions)  of  an  object  indicates  that  it  may  be 
a valuable  tool  for  defining  the  elusive  "primitives"  needed  for  syntactic 
recognition  (33).  In  addition,  because  the  formation  of  separable  subregions 
can  be  implemented  by  local  parallel  operations,  it  may  be  mimicking  certain 
feature/primitive  extraction  capabilities  observed  in  humans. 
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connectu<x . //X> 


connect  <x . //X> 


connect ^<x . //X> 


connect  <x.//X> 


Figure  2.  connect  Relation  on  a Square  Tessellation  Using 
Moore  Neighborhood  to  Define  Adjacency 
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Figure  3.  Region i Border  Cells  and  Non-border  Cells 
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Figure  4.  Simply-Connected  Region  and  Contour 
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Figure  5.  Multiply-Connected  Region  and  Contours 
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= develop  <Pft> 
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Figure  8.  Use  of  Print  Operations 


Figure  9.  Hasse  Diagram  for  String  (9) 
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Figure  10.  Decomposition  Using  Forward  Raster  Index 


Figure  12.  Decomposition  of  a Simply-Connected  Region 
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e3  = <vltV2> 
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Figure  13.  Skeleton  and  Graph  of  a Region 
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Figure  14*  Separable  Nodes  and  Subregions 
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Figure  15.  Shared  Separable  Subregions 
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Figure  17.  Decomposition  Using  Primary  Subsets  and  Nuclei 
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Figure  18.  Decomposition  Using  Separable  Subregions  and 
Separable  Nodes 
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